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Remarks/Arguments: 

The applicants respectfully request that, before formally examining the 
application identified above, the examiner accept the enclosed substitute specification under 37 
C.F.R. § 1.125. The applicants believe that the substitute specification will facilitate processing 
of the application in accordance with M.P.E.P. § 608.01(q) and assert that it complies with 37 
C.F.R. § 1.52. Although it makes no substantive changes, the substitute specification is 
submitted to conform this case to the formal requirements and long-established formal 
standards of U.S. Patent and Trademark Office practice, and to provide improved idiom and 
better grammatical form. 

Also enclosed and submitted is a marked-up copy which shows the portions of the 
original specification which are being added and deleted. 

STATEMENT 

The undersigned hereby states that the enclosed substitute specification contains 
no new matter and includes the same changes as are indicated in the marked-up copy of the 
original specification showing additions and deletions. 

By this preliminary amendment, the specification and the claims have been 
amended to place the application in better condition for examination and allowance. Claims 1- 
10 have been cancelled and claims 11-20 have been added to secure enhanced protection for 
the applicants' invention. These newly added claims find support in the instant specification 
and claims as filed. There is no issue of new matter. Entry of this preliminary amendment and 
issuance of a first action on the merits are requested. 

The foregoing amendments were not made for reasons related to patentability. 
Rather, the amendments were made to facilitate prosecution of the instant application. This 
amendment is made without disclaimer or prejudice to applicants' right to seek, in a derivative 
or related application, claims to any subject matter disclosed but not claimed in the present 
application. 
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INTEGRIN LIGAND, HUMAN MINDIN 

This application is a continuation of U.S. Application No. 10/324,981, filed 
December 20, 2002, which is a continuation of U.S. Application No. 10/117,166, filed 
5 April 5, 2002. which is a continuation of U.S. Application No. 09/963,266. filed September 
26. 2001, which is a continuation of U.S. Application No. 09/906,210, filed July 16. 2001, 
which is a continuation of U.S. Application No. 09/403,558, filed October 25, 1999, which 
is a U.S. National Phase application of PCT Serial No. PCT/US98/09476, filed May 7, 
1998, which claims th e b e n e fit prioritv benefits of U.S. Provisional Application No. 
10 60/046,106, filed 

May 9, 1997 t; each of whose contents are incorporated herein by reference in their 
entirety. 

Field of the Invention 

15 This invention relates to newly identified polypeptides and polynucleotides 

encoding such polypeptides, to their use in therapy and in identifying compounds which 
may be agonists, antagonists and/or inhibitors which are potentially useful in therapy, 
and to production of such polypeptides and polynucleotides. 

20 Background of the Invention 

The drug discovery process is currently undergoing a fundamental revolution as it 
embraces 'functional genomics', that is, high throughput genome- or gene-based biology. 
This approach is rapidly superceding earlier approaches based on 'positional cloning'. A 
phenotype, that is a biological function or genetic disease, would be identified and this 
25 would then be tracked back to the responsible gene, based on its genetic map position. 

Functional genomics relies heavily on the various tools of bioinformatics to identify 
gene sequences of potential interest from the many molecular biology databases now 
available. There is a continuing need to identify and characterise further genes and their 
related polypeptides/proteins, as targets for drug discovery. 

30 

Summary of the Invention 

The present invention relates to Human Mindin, in particular Human Mindin 
polypeptides and Human Mindin polynucleotides, recombinant materials and methods for 
their production. In another aspect, the invention relates to methods for using such 
35 polypeptides and polynucleotides, including the treatment of angiogenic diseases (cancer, 
cancer metastasis, chronic inflammatory disorders, rheumatoid arthritis, atherosclerosis, 
macular degeneration, diabetic retinopathy), restenosis, Alzheimer's disease, neural 
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disorders and tissue remodeling, hereinafter referred to as "the Diseases", amongst 
others. In a further aspect, the invention relates to methods for identifying agonists 
and antagonists/inhibitors using the materials provided by the invention, and treating 
conditions associated with Human Mindin imbalance with the identified compounds. In a 
5 still further aspect, the invention relates to diagnostic assays for detecting diseases 
associated with inappropriate Human Mindin activity or levels. 

Description of the Invention 

In a first aspect, the present invention relates to Human Mindin polypeptides. 

10 Such peptides include isolated polypeptides comprising an amino acid sequence which 
has at least 70% identity, preferably at least 80% identity, more preferably at least 
90% identity, yet more preferably at least 95% identity, most preferably at least 97- 
99% identity, to that of SEQ ID NO:2 over the entire length of SEQ ID NO:2. Such 
polypeptides include those comprising the amino acid of SEQ ID NO:2. 

15 Further peptides of the present invention include isolated polypeptides in which 

the amino acid sequence has at least 70% identity, preferably at least 80% identity, 
more preferably at least 90% identity, yet more preferably at least 95% identity, most 
preferably at least 97-99% identity, to the amino acid sequence of SEQ ID NO:2 over 
the entire length of SEQ ID NO:2. Such polypeptides include the polypeptide of SEQ ID 

20 NO:2. 

Further peptides of the present invention include isolated polypeptides encoded 
by a polynucleotide comprising the sequence contained in SEQ ID NO:l. 

Polypeptides of the present invention are believed to be members of the 
thrombospondin family of polypeptides. They are therefore of interest because Human 

25 Mindin is potentially a good therapeutic target for identification of antagonists to treat 
angiogenic diseases (i.e. cancer, cancer metastasis, chronic inflammatory disorders, 
rheumatoid arthritis, atherosclerosis, macular degeneration, and diabetic retinopathy), 
restenosis, Alzheimer's disease, neural disorders and tissue remodeling. These 
properties are hereinafter referred to as "Human Mindin activity" or "Human Mindin 

30 polypeptide activity" or "biological activity of Human Mindin". Also included amongst 
these activities are antigenic and immunogenic activities of said Human Mindin 
polypeptides, in particular the antigenic and immunogenic activities of the polypeptide of 
SEQ ID NO:2. Preferably, a polypeptide of the present invention exhibits at least one 
biological activity of Human Mindin. 

35 Human Mindin is a novel F spondin-like gene which could have a multifunctional 

activity in normal and disease states. F-spondin has amino acid sequences that reveal 
the presence of domains known as thrombospondin (TSP). The homology to the F- 
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spondin would "predict" that Human Mindin could have similar functions such F-spondin. 
F-spondin modulates cell-to-cell and/or cell-to-matrix interaction which is further 
supported by the anti-aggregative effect observed on cultured neuronal cells, aggregation 
of platelets, formation and lysis of fibrin, adhesion and migration of cells and progression 
5 of cells through the growth cycle. F-spondin is implicated as a potential regulator of 
neural cell adhesion, neurite extension and spinal cord differentiation. F-spondin's effect 
on growth, adhesion and guidance of neural cells could also be implicated in tumor 
growth and metastasis. Overexpression of F-spondin could cause "increased or 
suppressed" tumor growth of "neuronal" origin. One of the important functions could be 
10 its ability to bind to integrins, such as aVb3, allbb3 and other unknown integrin 
receptors. 

Integrins are a large family of cell surface receptors that mediate cell-to-cell as 
well as cell-to-matrix adhesion. Integrins recognize the R-G-D sequence in their 
extracellular matrix protein ligand. Human Mindin could be such a novel ligand which 

15 could play an important role in different diseases. Structurally, integrins consist of a 
heterodimer of an a and p chain. Each subunit has a large N-terminal extracellular 
domain followed by a transmembrane domain and a short C-terminal cytoplasmic region. 
Some receptors share a common p chain while having different a chains. The role of 
Human Mindin as an integrin ligand is of great interest due to its potential function in 

20 angiogenesis and in neural growth and differentiation. Numerous angiogenic-related and 
neural-related disorders have been described and the role of F-spondin could be 
implicated in cancer and specifically, cancer metastasis. It has been discovered that 
Human Mindin is "overexpressed" in numerous tissues (e.g., prostate, uterus, peripheral 
blood lymphocytes, mammary gland, bladder, cerebellum, occipital lobe, ovary, heart, 

25 liver, appendix, and others). The homology to the F-spondin could also suggest that 
Human Mindin may have proteolytic functions (proteolyze extracellular matrix or 
basement membrane proteins). The role of Human Mindin as a ligand to the integrin 
receptors with proteolytic activity fits its assigned role in neural growth, differentiation, 
angiogenesis and tissue remodeling. 

30 The polypeptides of the present invention may be in the form of the "mature" 

protein or may be a part of a larger protein such as a fusion protein. It is often 
advantageous to include an additional amino acid sequence which contains secretory or 
leader sequences, pro-sequences, sequences which aid in purification such as multiple 
histidine residues, or an additional sequence for stability during recombinant 

35 production. 

The present invention also includes include variants of the aforementioned 
polypeptides, that is polypeptides that vary from the referents by conservative amino acid 
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substitutions, whereby a residue is substituted by another with like characteristics. 
Typical such substitutions are among Ala, Val, Leu and He; among Ser and Thr; among 
the acidic residues Asp and Glu; among Asn and Gin; and among the basic residues Lys 
and Arg; or aromatic residues Phe and Tyr. Particularly preferred are variants in which 
5 several, 5-10, 1-5, 1-3, 1-2 or 1 amino acids are substituted, deleted, or added in any 
combination. 

Polypeptides of the present invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly 
produced polypeptides, synthetically produced polypeptides, or polypeptides produced by 

10 a combination of these methods. Means for preparing such polypeptides are well 
understood in the art. 

In a further aspect, the present invention relates to Human Mindin 
polynucleotides. Such polynucleotides include isolated polynucleotides comprising a 
nucleotide sequence encoding a polypeptide which has at least 70% identity, preferably 

15 at least 80% identity, more preferably at least 90% identity, yet more preferably at 

least 95% identity, to the amino acid sequence of SEQ ID NO:2, over the entire length of 
SEQ ID NO:2. In this regard, polypeptides which have at least 97% identity are highly 
preferred, whilst those with at least 98-99% identity are more highly preferred, and those 
with at least 99% identity are most highly preferred. Such polynucleotides include a 

20 polynucleotide comprising the nucleotide sequence contained in SEQ ID NO:l encoding 
the polypeptide of SEQ ID NO:2. 

Further polynucleotides of the present invention include isolated polynucleotides 
comprising a nucleotide sequence that has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% 

25 identity, to a nucleotide sequence encoding a polypeptide of SEQ ID NO: 2, over the entire 
coding region. In this regard, polynucleotides which have at least 97% identity are highly 
preferred, whilst those with at least 98-99% identity are more highly preferred, and those 
with at least 99% identity are most highly preferred. 

Further polynucleotides of the present invention include isolated polynucleotides 

30 comprising a nucleotide sequence which has at least 70% identity, preferably at least 
80% identity, more preferably at least 90% identity, yet more preferably at least 95% 
identity, to SEQ ID NO:l over the entire length of SEQ ID NO:l. In this regard, 
polynucleotides which have at least 97% identity are highly preferred, whilst those with 
at least 98-99% i^eftti yidentity are more highly preferred, and those with at least 99% 

35 identity are most highly preferred. Such polynucleotides include a polynucleotide 

comprising the polynucleotide of SEQ ID NO:l as well as the polynucleotide of SEQ ID 
NO:l. 
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The invention also provides polynucleotides which are complementary to all the 
above described polynucleotides. 

The nucleotide sequence of SEQ ID NO:l shows homology with Mindin2 of 
Zebrafish (S. Higashijima et al, Dev. Biol. 192: 211-27, 1997). The nucleotide sequence 
5 of SEQ ID NO: 1 is a cDNA sequence and comprises a polypeptide encoding sequence 

(nucleotides 1 to 996) encoding a polypeptide of 331 amino acids, the polypeptide of SEQ 
ID NO;2. The nucleotide sequence encoding the polypeptide of SEQ ID NO:2 may be 
identical to the polypeptide encoding sequence contained in SEQ ID NO:l or it may be 
a sequence other than the one contained in SEQ ID NO:l, which, as a result of the 
10 redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID 
NO: 2. The polypeptide of SEQ ID NO: 2 is structurally related to other proteins of the 
thrombospondin family, having homology and/or structural similarity with Mindin2 of 
Zebrafish (S. Higashijima et al, Dev. Biol. 192: 211-27, 1997). 

Preferred polypeptides and polynucleotides of the present invention are expected 
15 to have, inter alia, similar biological functions/properties to their homologous 
polypeptides and polynucleotides. Furthermore, preferred polypeptides and 
polynucleotides of the present invention have at least one Human Mindin activity. 

The present invention also relates to partial or other polynucleotide and 
polypeptide sequences which were first identified prior to the determination of the 
20 corresponding full length sequences of SEQ ID NO:l and SEQ ID NO:2. 

Accordingly, in a further aspect, the present invention provides for an isolated 
polynucleotide comprising: 

(a) a nucleotide sequence which has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% 

25 identity, even more preferably at least 97-99% identity to SEQ ID NO: 3 over the entire 
length of SEQ ID NO:3; 

(b) a nucleotide sequence which has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% 
identity, even more preferably at least 97-99% identity, to SEQ ID NO: 3 over the entire 

30 length of SEQ ID NO:3; 

(c) the polynucleotide of SEQ ID NO:3; or 

(d) a nucleotide sequence encoding a polypeptide which has at least 70% identity, 
preferably at least 80% identity, more preferably at least 90% identity, yet more 
preferably at least 95% identity, even more preferably at least 97-99% identity, to the 

35 amino acid sequence of SEQ ID NO:4, over the entire length of SEQ ID NO:4; 
as well as the polynucleotide of SEQ ID NO:3. 

The present invention further provides for a polypeptide which: 
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(a) comprises an amino acid sequence which has at least 70% identity, preferably at 
least 80% identity, more preferably at least 90% identity, yet more preferably at least 
95% identity, most preferably at least 97-99% identity, to that of SEQ ID NO:4 over 
the entire length of SEQ ID NO:4; 
5 (b) has an amino acid sequence which is at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% 
identity, most preferably at least 97-99% identity, to the amino acid sequence of SEQ 
ID IMO:4 over the entire length of SEQ ID NO:4; 
(c) comprises the amino acid of SEQ ID NO:4; and 
10 (d) is the polypeptide of SEQ ID NO:4; 

as well as polypeptides encoded by a polynucleotide comprising the sequence contained 
in SEQ ID NO:3. 

The nucleotide sequence of SEQ ID NO: 3 and the peptide sequence encoded 
thereby are derived from EST (Expressed Sequence Tag) sequences. It is 

15 r e cogni se d recoanized by those skilled in the art that there will inevitably be some 

nucleotide sequence reading errors in EST sequences (see Adams, M.D. et a/, Nature 
377 (supp) 3, 1995). Accordingly, the nucleotide sequence of SEQ ID NO:3 and the 
peptide sequence encoded therefrom are therefore sabje esubject to the same inherent 
limitations in sequence accuracy. Furthermore, the peptide sequence encoded by SEQ 

20 ID NO: 3 comprises a region of identity or close homology and/or close structural 
similarity (for example a conservative amino acid difference) with the closest 
homologous or structurally similar protein. 

Polynucleotides of the present invention may be obtained, using standard cloning 
and screening techniques, from a cDNA library derived from mRNA in cells of human 

25 epithelial sarcoma, using the expressed sequence tag (EST) analysis (Adams, M.D., et 
a/. Science (1991) 252:1651-1656; Adams, M.D. etal., Nature, (1992) 355:632-634; 
Adams, M.D., et aL, Nature (1995) 377 Supp:3-174). Polynucleotides of the invention 
can also be obtained from natural sources such as genomic DNA libraries or can be 
synthesized using well-known and commercially available techniques. 

30 When polynucleotides of the present invention are used for the recombinant 

production of polypeptides of the present invention, the polynucleotide may include the 
coding sequence for the mature polypeptide, by itself; or the coding sequence for the 
mature polypeptide in reading frame with other coding sequences, such as those 
encoding a leader or secretory sequence, a pre-, or pro- or prepro- protein sequence, or 

35 other fusion peptide portions. For example, a marker sequence which facilitates 

purification of the fused polypeptide can be encoded. In certain preferred embodiments 
of this aspect of the invention, the marker sequence is a hexa-histidine peptide, as 
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provided in the pQE vector (Qiagen, Inc.) and described in Gentz etaL, Proc Natl Acad Sci 
USA (1989) 86:821-824, or is an HA tag. The polynucleotide may also contain non- 
coding 5' and 3' sequences, such as transcribed, non-translated sequences, splicing and 
polyadenylation signals, ribosome binding sites and sequences that stabilize mRNA. 
5 Further embodiments of the present invention include polynucleotides encoding 

polypeptide variants which comprise the amino acid sequence of SEQ ID NO: 2 and in 
which several, for instance from 5 to 10, 1 to 5, 1 to 3, 1 to 2 or 1, amino acid residues 
are substituted, deleted or added, in any combination. 

Polynucleotides which are identical or sufficiently identical to a nucleotide 

10 sequence contained in SEQ ID NO:l, may be used as hybridization probes for cDNA and 
genomic DNA or as primers for a nucleic acid amplification (PCR) reaction, to isolate full- 
length cDNAs and genomic clones encoding polypeptides of the present invention and to 
isolate cDNA and genomic clones of other genes (including genes encoding homologs and 
orthologs from species other than human) that have a high sequence similarity to SEQ ID 

15 NO:l. Typically these nucleotide sequences are 70% identical, preferably 80% identical, 
more preferably 90% identical, most preferably 95% identical to that of the referent. The 
probes or primers will generally comprise at least 15 nucleotides, preferably, at least 30 
nucleotides and may have at least 50 nucleotides. Particularly preferred probes will have 
between 30 and 50 nucleotides. 

20 A polynucleotide encoding a polypeptide of the present invention, including 

homologs and orthologs from species other than human, may be obtained by a process 
which comprises the steps of screening an appropriate library under stringent 
hybridization conditions with a labeled probe having the sequence of SEQ ID NO: 1 or a 
fragment thereof; and isolating full-length cDNA and genomic clones containing said 

25 polynucleotide sequence. Such hybridization techniques are well known to the skilled 

artisan. Preferred stringent hybridization conditions include overnight incubation at 42°C 
in a solution comprising: 50% formamide, 5xSSC (150mM NaCI, 15mM trisodium citrate), 
50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 10 % dextran sulfate, and 20 
microgram/ml denatured, sheared salmon sperm DNA; followed by washing the filters in 

30 O.lx SSC at about 65°C. Thus the present invention also includes polynucleotides 
obtainable by screening an appropriate library under sting e n t strinaent hybridization 
conditions with a labeled probe having the sequence of SEQ ID NO:l or a fragment 
thereof. 

The skilled artisan will appreciate that, in many cases, an isolated cDNA 
35 sequence will be incomplete, in that the region coding for the polypeptide is cut short at 
the 5' end of the cDNA. This is a consequence of reverse transcriptase, an enzyme with 
inherently low 'processivity' (a measure of the ability of the enzyme to remain attached 
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to the template during the polymerisation reaction), failing to complete a DNA copy of 
the mRNA template during 1st strand cDNA synthesis. 

There are several methods available and well known to those skilled in the art to 
obtain full-length cDNAs, or extend short cDNAs, for example those based on the 
5 method of Rapid Amplification of cDNA ends (RACE) (see, for example, Frohman et al. f 
PNAS USA 85, 8998-9002, 1988). Recent modifications of the technique, exemplified 
by the Marathon™' technology (Clontech Laboratories Inc.) for example, have 
significantly simplified the search for longer cDNAs. In the Marathon™ technology, 
cDNAs have been prepared from mRNA extracted from a chosen tissue and an 'adaptor' 

10 sequence ligated onto each end. Nucleic acid amplification (PCR) is then carried out to 
amplify the 'missing 1 5' end of the cDNA using a combination of gene specific and 
adaptor specific oligonucleotide primers. The PCR reaction is then repeated using 
'nested' primers, that is, primers designed to anneal within the amplified product 
(typically an adaptor specific primer that anneals further 3 ! in the adaptor sequence 

15 and a gene specific primer that anneals further 5' in the known gene sequence). The 
products of this reaction can then be analysod analyzed by DNA sequencing and a full- 
length cDNA constructed either by joining the product directly to the existing cDNA to 
give a complete sequence, or carrying out a separate full-length PCR using the new 
sequence information for the design of the 5' primer. 

20 Recombinant polypeptides of the present invention may be prepared by processes 

well known in the art from genetically engineered host cells comprising expression 
systems. Accordingly, in a further aspect, the present invention relates to expression 
systems which comprise a polynucleotide or polynucleotides of the present invention, to 
host cells which are genetically engineered with such expression sytems systems and to 

25 the production of polypeptides of the invention by recombinant techniques. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. 

For recombinant production, host cells can be genetically engineered to 
incorporate expression systems or portions thereof for polynucleotides of the present 

30 invention. Introduction of polynucleotides into host cells can be effected by methods 
described in many standard laboratory manuals, such as Davis et al., Basic Methods in 
Molecular Biology (1986) and Sambrook et al., Molecular Cloning: A Laboratory Manual, 
2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). 
Preferred such methods include, for instance, calcium phosphate transfection, DEAE- 

35 dextran mediated transfection, transvection, microinjection, cationic lipid-mediated 
transfection, electroporation, transduction, scrape loading, ballistic introduction or 
infection. 
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Representative examples of appropriate hosts include bacterial cells, such as 
streptococci, staphylococci, E. coli, Streptomyces and Bacillus subtilis cells; fungal cells, 
such as yeast cells and Aspergillus cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, HEK 293 
5 and Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used, for instance, chromosomal, 
episomal and virus-derived systems, e.g., vectors derived from bacterial plasmids, from 
bacteriophage, from transposons, from yeast episomes, from insertion elements, from 
yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such 

10 as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and 

retroviruses, and vectors derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
expression systems may contain control regions that regulate as well as engender 
expression. Generally, any system or vector which is able to maintain, propagate or 

15 express a polynucleotide to produce a polypeptide in a host may be used. The 

appropriate nucleotide sequence may be inserted into an expression system by any of a 
variety of well-known and routine techniques, such as, for example, those set forth in 
Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL {supra). Appropriate 
secretion signals may be incorporated into the desired polypeptide to allow secretion of 

20 the translated protein into the lumen of the endoplasmic reticulum, the periplasmic space 
or the extracellular environment. These signals may be endogenous to the polypeptide or 
they may be heterologous signals. 

If a polypeptide of the present invention is to be expressed for use in screening 
assays, it is generally preferred that the polypeptide be produced at the surface of the 

25 cell. In this event, the cells may be harvested prior to use in the screening assay. If 
the polypeptide is secreted into the medium, the medium can be recovered in order to 
recover and purify the polypeptide. If produced intracellular^, the cells must first be 
lysed before the polypeptide is recovered. 

Polypeptides of the present invention can be recovered and purified from 

30 recombinant cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography is employed for purification. Well-- 

35 known techniques for refolding proteins may be employed to regenerate active 

conformation when the polypeptide is denatured during isolation and or purification. 
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This invention also relates to the use of polynucleotides of the present invention as 
diagnostic reagents. Detection of a mutated form of the gene charact e rised cha racteri zed 
by the polynucleotide of SEQ 10 NO:l which is associated with a dysfunction will provide 
a diagnostic tool that can add to, or define, a diagnosis of a disease, or susceptibility to a 
5 disease, which results from under-expression, over-expression or altered expression of 
the gene. Individuals carrying mutations in the gene may be detected at the DNA level 
by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from a subject's cells, such as from 
blood, urine, saliva, tissue biopsy or autopsy material. The genomic DNA may be used 

10 directly for detection or may be amplified enzymatically by using PCR or other 

amplification techniques prior to analysis. RNA or cDNA may also be used in similar 
fashion. Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. Point mutations can be identified by 
hybridizing amplified DNA to labeled Human Mindin nucleotide sequences. Perfectly 

15 matched sequences can be distinguished from mismatched duplexes by RNase digestion 
or by differences in melting temperatures. DNA sequence differences may also be 
detected by alterations in electrophoretic mobility of DNA fragments in gels, with or 
without denaturing agents, or by direct DNA sequencing (ee, e.g., Myers etal. t Science 
(1985) 230:1242). Sequence changes at specific locations may also be revealed by 

20 nuclease protection assays, such as RNase and SI protection or the chemical cleavage 
method (see Cotton etal., Proc Natl Acad Sci USA (1985) 85: 4397-4401). In another 
embodiment, an array of oligonucleotides probes comprising Human Mindin nucleotide 
sequence or fragments thereof can be constructed to conduct efficient screening of e.g., 
genetic mutations. Array technology methods are well known and have general 

25 applicability and can be used to address a variety of questions in molecular genetics 
including gene expression, genetic linkage, and genetic variability (see for example: 
M.Chee et al., Science, Vol 274, pp 610-613 (1996)). 

The diagnostic assays offer a process for diagnosing or determining a susceptibility 
to the Diseases through detection of mutation in the Human Mindin gene by the methods 

30 described. In addition, such diseases may be diagnosed by methods comprising 
determining from a sample derived from a subject an abnormally decreased or 
increased level of polypeptide or mRNA. Decreased or increased expression can be 
measured at. the RNA level using any of the methods well known in the art for the 
quantitation of polynucleotides, such as, for example, nucleic acid amplification, for 

35 instance PCR, RT-PCR, RNase protection, Northern blotting and other hybridization 
methods. Assay techniques that can be used to determine levels of a protein, such as a 
polypeptide of the present invention, in a sample derived from a host are well-known to 
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those of skill in the art. Such assay methods include radioimmunoassays, competitive- 
binding assays, Western Blot analysis and ELISA assays. 

Thus in another aspect, the present invention relates to a diagono s tic d i a q n ost i c 
kit which comprises: 

5 (a) a polynucleotide of the present invention, preferably the nucleotide sequence of 
SEQ ID NO: 1, or a fragment thereof ; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID NO:2 
or a fragment thereof; or 

10 (d) an antibody to a polypeptide of the present invention, preferably to the polypeptide 
of SEQ ID NO:2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a 
substantial component. Such a kit will be of use in diagnosing a disease or 
ousp e ctQbility susceptibility to a disease, particularly angiogenic diseases (cancer, cancer 

15 metastasis, chronic inflammatory disorders, rheumatoid arthritis, atherosclerosis, macular 
degeneration, diabetic retinopathy), restenosis, Alzheimer's disease, neural disorders and 
tissue r e mod e lin g re modelling , amongst others. 

The nucleotide sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to, and can hybridize 

20 with, a particular location on an individual human chromosome. The mapping of relevant 
sequences to chromosomes according to the present invention is an important first step 
in correlating those sequences with gene associated disease. Once a sequence has been 
mapped to a precise chromosomal location, the physical position of the sequence on the 
chromosome can be correlated with genetic map data. Such data are found in, for 

25 example, V. McKusick, Mendelian Inheritance in Man (available on-line through Johns 

Hopkins University Welch Medical Library). The relationship between genes and diseases 
that have been mapped to the same chromosomal region are then identified through 
linkage analysis (coinheritance of physically adjacent genes). 

The differences in the cDNA or genomic sequence between affected and 

30 unaffected individuals can also be determined. If a mutation is observed in some or all 
of the affected individuals but not in any normal individuals, then the mutation is likely 
to be the causative agent of the disease. 

*Th e g e n e of th e pres e nt inv e ntion map s to human chromosom e *Chromosom e Location. 

The polypeptides of the invention or their fragments or analogs thereof, or cells 
35 expressing them, can also be used as immunogens to produce antibodies immunospecific 
for polypeptides of the present invention. The term "immunospecific" means that the 
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antibodies have substantially greater affinity for the polypeptides of the invention than 
their affinity for other related polypeptides in the prior art. 

Antibodies generated against polypeptides of the present invention may be 
obtained by administering the polypeptides or epitope-bearing fragments, analogs or cells 
5 to an animal, preferably a non-human animal, using routine protocols. For preparation of 
monoclonal antibodies, any technique which provides antibodies produced by continuous 
cell line cultures can be used. Examples include the hybridoma technique (Kohler, G. and 
Milstein, C, Nature (1975) 256:495-497), the trioma technique, the human B-cell 
hybridoma technique (Kozbor etal., Immunology Today (1983) 4:72) and the EBV- 

10 hybridoma technique (Cole et al. t MONOCLONAL ANTIBODIES AND CANCER THERAPY, pp. 
77-96, Alan R. Liss, Inc., 1985). 

Techniques for the production of single chain antibodies, such as those described 
in U.S. Patent No. 4,946,778, can also be adapted to produce single chain antibodies to 
polypeptides of this invention. Also, transgenic mice, or other organisms, including other 

15 mammals, may be used to express humanized antibodies. 

The above-described antibodies may be employed to isolate or to identify clones 
expressing the polypeptide or to purify the polypeptides by affinity chromatography. 

Antibodies against polypeptides of the present invention may also be employed to 
treat the Diseases, amongst others. 

20 In a further aspect, the present invention relates to genetically engineered 

soluble fusion proteins comprising a polypeptide of the present invention, or a fragment 
thereof, and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclasses (IgG, IgM, IgA, IgE). Preferred as an 
immunoglobulin is the constant part of the heavy chain of human IgG, particularly 

25 IgGl, where fusion takes place at the hinge region. In a particular embodiment, the Fc 
part can be removed simply by incorporation of a cleavage sequence which can be 
cleaved with blood clotting factor Xa. Furthermore, this invention relates to processes 
for the preparation of these fusion proteins by genetic engineering, and to the use 
thereof for drug screening, diagnosis and therapy. A further aspect of the invention 

30 also relates to polynucleotides encoding such fusion proteins. Examples of fusion 

protein technology can be found in International Patent Application Nos. W094/29458 
and W094/22914. 

Another aspect of the invention relates to a method for inducing an 
immunological response in a mammal which comprises inoculating the mammal with a 
35 polypeptide of the present invention, adequate to produce antibody and/or T cell 
immune response to protect said animal from the Diseases hereinbefore mentioned, 
amongst others. Yet another aspect of the invention relates to a method of inducing 
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immunological response in a mammal which comprises, delivering a polypeptide of the 
present invention via a vector directing expression of the polynucleotide and coding for 
the polypeptide in vivo in order to induce such an immunological response to produce 
antibody to protect said animal from diseases. 
5 A further aspect of the invention relates to an immunological/vaccine 

formulation (composition) which, when introduced into a mammalian host, induces an 
immunological response in that mammal to a polypeptide of the present invention 
wherein the composition comprises a polypeptide or polynucleotide of the present 
invention. The vaccine formulation may further comprise a suitable carrier Since a 

10 polypeptide may be broken down in the stomach, it is preferably administered 

parenterally (for instance, subcutaneous, intramuscular, intravenous, or intradermal 
injection). Formulations suitable for parenteral administration include aqueous and 
non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, 
bacteriostats and solutes which render the formulation fflstem eisotonic with the blood of 

15 the recipient; and aqueous and non-aqueous sterile suspensions which may include 
suspending agents or thickening agents. The formulations may be presented in unit- 
dose or multi-dose containers, for example, sealed ampoules and vials and may be 
stored in a freeze-dried condition requiring only the addition of the sterile liquid carrier 
immediately prior to use. The vaccine formulation may also include adjuvant systems 

20 for enhancing the immunogenicity of the formulation, such as oil-in water systems and 
other systems known in the art. The dosage will depend on the specific activity of the 
vaccine and can be readily determined by routine experimentation. 

Polypeptides of the present invention are responsible for many biological 
functions, including many disease states, in particular the Diseases hereinbefore 

25 mentioned. It is therefore desirous to devise screening methods to identify compounds 
which stimulate or which inhibit the function of the polypeptide. Accordingly, in a further 
aspect, the present invention provides for a method of screening compounds to identify 
those which stimulate or which inhibit the function of the polypeptide. In general, 
agonists or antagonists may be employed for therapeutic and prophylactic purposes for 

30 such Diseases as hereinbefore mentioned. Compounds may be identified from a variety 
of sources, for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. Such agonists, antagonists or inhibitors so-identified may be natural or 
modified substrates, ligands, receptors, enzymes, etc., as the case may be, of the 
polypeptide; or may be structural or functional mimetics thereof (see Coligan eta/., 

35 Current Protocols in Immunology l(2):Chapter 5 (1991)). 

The screening method may simply measure the binding of a candidate 
compound to the polypeptide, or to cells or membranes bearing the polypeptide, or a 
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fusion protein thereof by means of a label directly or indirectly associated with the 
candidate compound. Alternatively, the screening method may involve competition 
with a labeled competitor. Further, these screening methods may test whether the 
candidate compound results in a signal generated by activation or inhibition of the 
5 polypeptide, using detection systems appropriate to the cells bearing the polypeptide. 
Inhibitors of activation are generally assayed in the presence of a known agonist and 
the effect on activation by the agonist by the presence of the candidate compound is 
observed. Constitutively active polpypeptides may be employed in screening methods 
for inverse agonists or inhibitors, in the absence of an agonist or inhibitor, by testing 

10 whether the candidate compound results in inhibition of activation of the polypeptide. 
Further, the screening methods may simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide of the present invention, to form a 
mixture, measuring Human Mindin activity in the mixture, and comparing the Human 
Mindin activity of the mixture to a standard. Fusion proteins, such as those made from 

15 Fc portion and Human Mindin polypeptide, as hereinbefore described, can also be used 
for high-throughput screening assays to identify antagonists for the polypeptide of the 
present invention (see D. Bennett et ai. f J Mol Recognition, 8:52-58 (1995); and K. 
Johanson eta/., J Biol Chem, 270(16):9459-9471 (1995)). 
*Scr ee ningT e chniqu e 

20 

The polynucleotides, polypeptides and antibodies to the polypeptide of the 
present invention may also be used to configure screening methods for detecting the 
effect of added compounds on the production of mRNA and polypeptide in cells. For 
example, an ELISA assay may be constructed for measuring secreted or cell associated 

25 levels of polypeptide using monoclonal and polyclonal antibodies by standard methods 
known in the art. This can be used to discover agents which may inhibit or enhance 
the production of polypeptide (also called antagonist or agonist, respectively) from 
suitably manipulated cells or tissues. 

The polypeptide may be used to identify membrane bound or soluble receptors, 

30 if any, through standard receptor binding techniques known in the art. These include, 
but are not limited to, ligand binding and crosslinking assays in which the polypeptide is 
labeled with a radioactive isotope (for instance, 12 ^I), chemically modified (for 
instance, biotinylated), or fused to a peptide sequence suitable for detection or 
purification, and incubated with a source of the putative receptor (cells, cell 

35 membranes, cell supernatants, tissue extracts, bodily fluids). Other methods include 
biophysical techniques such as surface plasmon resonance and spectroscopy. These 
screening methods may also be used to identify agonists and antagonists of the 
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polypeptide which compete with the binding of the polypeptide to its receptors, if any. 
Standard methods for conducting such assays are well understood in the art, 

Examples of potential polypeptide antagonists include antibodies or, in some 
cases, oligonucleotides or proteins which are closely related to the ligands, substrates, 
5 receptors, enzymes, etc., as the case may be, of the polypeptide, e.g., a fragment of the 
ligands, substrates, receptors, enzymes, etc.; or small molecules which bind to the 
polypeptide of the present invention but do not elicit a response, so that the activity of 
the polypeptide is prevented. % 

Thus, in another aspect, the present invention relates to a screening kit for 
10 identifying agonists, antagonists, ligands, receptors, substrates, enzymes, etc. for 
polypeptides of the present invention; or compounds which decrease or enhance the 
production of such polypeptides, which comprises: 

(a) a polypeptide of the present invention; 

(b) a recombinant cell expressing a polypeptide of the present invention; 
15 (c) a cell membrane expressing a polypeptide of the present invention; or 

(d) antibody to a polypeptide of the present invention; 
which polypeptide is preferably that of SEQ ID NO: 2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a 
substantial component. 
20 It will be readily appreciated by the skilled artisan that a polypeptide of the 

present invention may also be used in a method for the structure-based design of an 
agonist, antagonist or inhibitor of the polypeptide, by: 
(a) determining in the first instance the three-dimensional structure of the 
polypeptide; 

25 (b) deducing the three-dimensional structure for the likely reactive or binding site(s) of 
an agonist, antagonist or inhibitor; 

(c) synthe s in g svnthesizina candidate compounds that are predicted to bind to or react 
with the deduced binding or reactive site; and 

(d) testing whether the candidate compounds are indeed agonists, antagonists or 
30 inhibitors. 

It will be further appreciated that this will normally be an interativ e iterative process. 

In a further aspect, the present invention provides methods of treating abnormal 
conditions such as, for instance, angiogenic diseases (cancer, cancer metastasis, chronic 
inflammatory disorders, rheumatoid arthritis, atherosclerosis, macular degeneration, 
35 diabetic retinopathy), restenosis, Alzheimer's disease, neural disorders and tissue 
remodeling, related to either an excess of, or an under-expression of, Human Mindin 
polypeptide activity. 
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If the activity of the polypeptide is in excess, several approaches are available. 
One approach comprises administering to a subject in need thereof an inhibitor 
compound (antagonist) as hereinabove described, optionally in combination with a 
pharmaceutical^ acceptable carrier, in an amount effective to inhibit the function of the 
5 polypeptide, such as, for example, by blocking the binding of ligands, substrates, 
receptors, enzymes, etc., or by inhibiting a second signal, and thereby alleviating the 
abnormal condition. In another approach, soluble forms of the polypeptides still capable 
of binding the ligand, substrate, enzymes, receptors, etc. in competition with 
endogenous polypeptide may be administered. Typical examples of such competitors 

10 include fragments of the Human Mindin polypeptide. 

In still another approach, expression of the gene encoding endogenous Human 
Mindin polypeptide can be inhibited using expression blocking techniques. Known such 
techniques involve the use of antisense sequences, either internally generated or 
separately administered (see, for example, O'Connor, J Neurochem (1991) 56:560 in 

15 Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Alternatively, oligonucleotides which form triple helices with the 
gene can be supplied (see, for example, Lee et al. f Nucleic Acids Res (1979) 6:3073; 
Cooney etal., Science (1988) 241:456; Dervan etat., Science (1991) 251:1360). 
These oligomers can be administered perse or the relevant oligomers can be expressed 

20 in vivo. 

For treating abnormal conditions related to an under-expression of Human Mindin 
and its activity, several approaches are also available. One approach comprises 
administering to a subject a therapeutically effective amount of a compound which 
activates a polypeptide of the present invention, i.e., an agonist as described above, in 

25 combination with a pharmaceutically acceptable carrier, to thereby alleviate the abnormal 
condition. Alternatively, gene therapy may be employed to effect the endogenous 
production of Human Mindin by the relevant cells in the subject. For example, a 
polynucleotide of the invention may be engineered for expression in a replication 
defective retroviral vector, as discussed above. The retroviral expression construct may 

30 then be isolated and introduced into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present invention such that the 
packaging cell now produces infectious viral particles containing the gene of interest. 
These producer cells may be administered to a subject for engineering cells in vivo and 
expression of the polypeptide in vivo. For an overview of gene therapy, see Chapter 20, 

35 Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, (and 

references cited therein) in Human Molecular Genetics, T Strachan and A P Read, BIOS 
Scientific Publishers Ltd (1996). Another approach is to administer a therapeutic amount 
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of a polypeptide of the present invention in combination with a suitable pharmaceutical 
carrier. 

In a further aspect, the present invention provides for pharmaceutical 
compositions comprising a therapeutically effective amount of a polypeptide, such as the 
5 soluble form of a polypeptide of the present invention, agonist/antagonist peptide or 
small molecule compound, in combination with a pharmaceutical^ acceptable carrier or 
excipient. Such carriers include, but are not limited to, saline, buffered saline, dextrose, 
water, glycerol, ethanol, and combinations thereof. The invention further relates to 
pharmaceutical packs and kits comprising one or more containers filled with one or more 

10 of the ingredients of the aforementioned compositions of the invention. Polypeptides and 
other compounds of the present invention may be employed alone or in conjunction with 
other compounds, such as therapeutic compounds. 

The composition will be adapted to the route of administration, for instance by a 
systemic or an oral route. Preferred forms of systemic administration include injection, 

15 typically by intravenous injection. Other injection routes, such as subcutaneous, 
intramuscular, or intraperitoneal, can be used. Alternative means for systemic 
administration include transmucosal and transdermal administration using penetrants 
such as bile salts or fusidic acids or other detergents. In addition, if a polypeptide or 
other compounds of the present invention can be formulated in an enteric or an 

20 encapsulated formulation, oral administration may also be possible. Administration of 
these compounds may also be topical and/or localized, in the form of salves, pastes, gels, 
and the like. 

The dosage range required depends on the choice of peptide or other compounds 
of the present invention, the route of administration, the nature of the formulation, the 

25 nature of the subject's condition, and the judgment of the attending practitioner. 

Suitable dosages, however, are in the range of 0.1-100 M9/kg of subject. Wide variations 
in the needed dosage, however, are to be expected in view of the variety of compounds 
available and the differing efficiencies of various routes of administration. For example, 
oral administration would be expected to require higher dosages than administration by 

30 intravenous injection. Variations in these dosage levels can be adjusted using standard 
empirical routines for optimization, as is well understood in the art. 

Polypeptides used in treatment can also be generated endogenously in the 
subject, in treatment modalities often referred to as "gene therapy" as described above. 
Thus, for example, cells from a subject may be engineered with a polynucleotide, such as 

35 a DNA or RNA, to encode a polypeptide ex vivo, and for example, by the use of a 
retroviral plasmid vector. The cells are then introduced into the subject. 
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Polynucleotide and polypeptide sequences form a valuable information resource 
with which to identify further sequences of similar homology. This is most easily 
facilitated by storing the sequence in a computer readable medium and then using the 
stored data to search a sequence database using well known searching tools, such as 
5 GCC. Accordingly, in a further aspect, the present invention provides for a computer 
readable medium having stored thereon a polynucleotide comprising the sequence of 
SEQ ID NO:l and/or a polypeptide sequence encoded thereby. 

The following definitions are provided to facilitate understanding of certain terms 
used frequently hereinbefore. 

10 "Antibodies" as used herein includes polyclonal and monoclonal antibodies, 

chimeric, single chain, and humanized antibodies, as well as Fab fragments, including 
the products of an Fab or other immunoglobulin expression library. 

"Isolated" means altered "by the hand of man" from the natural state. If an 
"isolated" composition or substance occurs in nature, it has been changed or removed 

15 from its original environment, or both. For example, a polynucleotide or a polypeptide 
naturally present in a living animal is not "isolated," but the same polynucleotide or 
polypeptide separated from the coexisting materials of its natural state is "isolated", as 
the term is employed herein. 

"Polynucleotide" generally refers to any polyribonucleotide or 

20 polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
"Polynucleotides" include, without limitation, single- and double-stranded DNA, DNA 
that is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules 
comprising DNA and RNA that may be single-stranded or, more typically, double- 

25 stranded or a mixture of single- and double-stranded regions. In addition, 

"polynucleotide" refers to triple-stranded regions comprising RNA or DNA or both RNA 
and DNA. The term "polynucleotide" also includes DNAs or RNAs containing one or 
more modified bases and DNAs or RNAs with backbones modified for stability or for 
other reasons. "Modified" bases include, for example, tritylated bases and unusual 

30 bases such as inosine. A variety of modifications may be made to DNA and RNA; thus, 
"polynucleotide" embraces chemically, enzymatically or metabolically modified forms of 
polynucleotides as typically found in nature, as well as the chemical forms of DNA and 
RNA characteristic of viruses and cells. "Polynucleotide" also embraces relatively short 
polynucleotides, often referred to as oligonucleotides. 

35 "Polypeptide" refers to any peptide or protein comprising two or more amino 

acids joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres. "Polypeptide" refers to both short chains, commonly referred to as peptides, 
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oligopeptides or oligomers, and to longer chains, generally referred to as proteins. 
Polypeptides may contain amino acids other than the 20 gene-encoded amino acids. 
"Polypeptides" include amino acid sequences modified either by natural processes, such 
as post-translational processing, or by chemical modification techniques which are well 
5 known in the art. Such modifications are well-described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications may 
occur anywhere in a polypeptide, including the peptide backbone, the amino acid side- 
chains and the amino or carboxyl termini. It will be appreciated that the same type of 
modification may be present to the same or varying degrees at several sites in a given 

10 polypeptide. Also, a given polypeptide may contain many types of modifications. 
Polypeptides may be branched as a result of ubiquitination, and they may be cyclic, 
with or without branching. Cyclic, branched and branched cyclic polypeptides may 
result from post-translation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 

15 attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cystine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 

20 anchor formation, hydroxyiation, iodination, methylation, myristoylation, oxidation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, 
sulfation, transfer-RNA mediated addition of amino acids to proteins such as 
arginylation, and ubiquitination (see, for instance, PROTEINS - STRUCTURE AND 
MOLECULAR PROPERTIES, 2nd Ed., T, E. Creighton, W. H. Freeman and Company, 

25 New York, 1993; Wold, F., Post-translational Protein Modifications: Perspectives and 
Prospects, pgs. 1-12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF 
PROTEINS, B. C. Johnson, Ed., Academic Press, New York, 1983; Seifter etal., 
"Analysis for protein modifications and nonprotein cofactors", Meth Enzymol (1990) 
182:626-646 and Rattan et al., "Protein Synthesis: Post-translational Modifications and 

30 Aging", Ann NY Acad Sci (1992) 663:48-62). 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 

35 sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide 
changes may result in amino acid substitutions, additions, deletions, fusions and 
truncations in the polypeptide encoded by the reference sequence, as discussed below. 



DDX-100US PATENT 



A typical variant of a polypeptide differs in amino acid sequence from another, 
reference polypeptide. Generally, differences are limited so that the sequences of the 
reference polypeptide and the variant are closely similar overall and, in many regions, 
identical. A variant and reference polypeptide may differ in amino acid sequence by 
5 one or more substitutions, additions, deletions in any combination. A substituted or 
inserted amino acid residue may or may not be one encoded by the genetic code. A 
variant of a polynucleotide or polypeptide may be a naturally occurring such as an 
allelic variant, or it may be a variant that is not known to occur naturally. Non- 
naturally occurring variants of polynucleotides and polypeptides may be made by 

10 mutagenesis techniques or by direct synthesis. 

"Identity," as known in the art, is a relationship between two or more polypeptide 
sequences or two or more polynucleotide sequences, as the case may be, as determined by 
comparing the sequences. In the art, "identity" also means the degree of sequence 
relatedness between polypeptide or polynucleotide sequences, as the case may be, as 

15 determined by the match between strings of such sequences. "Identity" can be readily 
calculated by known methods, including but not limited to those described in 
(Computational Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 
1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic 
Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A.M., and 

20 Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular 
Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, 
M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and 
Lipman, D., SIAM J. Applied Math., 48: 1073 (1988). Methods to determine identity are 
designed to give the largest match between the sequences tested. Moreover, methods to 

25 determine identity are codified in publicly available computer programs. Computer 
program methods to determine identity between two sequences include, but are not 
limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 
387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., J. Mo/ec. Biol. 215: 403- 
410 (1990). The BLAST X program is publicly available from NCBI and other sources 

30 {BLAST Manual, Altschul, S., etal., NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et 
al., J. Mol. Biol. 215: 403-410 (1990). The well-known Smith Waterman algorithm may 
also be used to determine identity. 

Parameters for polypeptide sequence comparison include the following: 
1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

35 Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl. Acad. Sci. 
USA. 89:10915-10919 (1992) 
Gap Penalty: 12 
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Gap Length Penalty: 4 

A program useful with these parameters Is publicly available as the "gap" program from 
Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for peptide comparisons (along with no penalty for end gaps). 
5 Parameters for polynucleotide comparison include the following: 

1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

10 Available as: The "gap" program from Genetics Computer Group, Madison WI. These are 
the default parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the 
case may be, are provided in (1) and (2) below. 

(1) Polynucleotide embodiments further include an isolated polynucleotide 

15 comprising a polynucleotide sequence having at least a 50, 60, 70, 80, 85, 90, 95, 97 or 
100% identity to the reference sequence of SEQ ID NO:l, wherein said polynucleotide 
sequence may be identical to the reference sequence of SEQ ID NO: 1 or may include up 
to a certain integer number of nucleotide alterations as compared to the reference 
sequence, wherein said alterations are selected from the group consisting of at least one 

20 nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5' or 3* terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 
individually among the nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence, and wherein said number of nucleotide 

25 alterations is determined by multiplying the total number of nucleotides in SEQ ID IMO:l 
by the integer defining the percent identity divided by 100 and then subtracting that 
product from said total number of nucleotides in SEQ ID NO:l, or: 

n n <; x n - (x n • y), 

30 

wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides 
in SEQ ID NO:!, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol 
for the multiplication operator, and wherein any non-integer product of x n and y is 
35 rounded down to the nearest integer prior to subtracting it from x n . Alterations of a 
polynucleotide sequence encoding the polypeptide of SEQ ID NO:2 may create nonsense, 
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missense or frameshift mutations in this coding sequence and thereby alter the 
polypeptide encoded by the polynucleotide following such alterations. 

By way of example, a polynucleotide sequence of the present invention may be 
identical to the reference sequence of SEQ ID NO: 2, that is it may be 100% identical, or 

5 it may include up to a certain integer number of amino acid alterations as compared to 
the reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one nucleic acid deletion, 
substitution, including transition and transversion, or insertion, and wherein said 
alterations may occur at the 5' or 3' terminal positions of the reference polynucleotide 

10 sequence or anywhere between those terminal positions, interspersed either individually 
among the nucleic acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. The number of nucleic acid alterations for a given percent 
identity is determined by multiplying the total number of amino acids in SEQ ID NO:2 by 
the integer defining the percent identity divided by 100 and then subtracting that product 

15 from said total number of amino acids in SEQ ID NO:2, or: 

n n < x n - (x n • y), 

wherein n n is the number of amino acid alterations, x n is the total number of amino 
acids in SEQ ID IMO:2, y is, for instance 0.70 for 70%, 0,80 for 80%, 0.85 for 85% etc., • 
is the symbol for the multiplication operator, and wherein any non-integer product of x n 
and y is rounded down to the nearest integer prior to subtracting it from x n . 

(2) Polypeptide embodiments further include an isolated polypeptide 
comprising a polypeptide having at least a 50,60, 70, 80, 85, 90, 95, 97 or 100% 
identity to a polypeptide reference sequence of SEQ ID IMO:2, wherein said polypeptide 
sequence may be identical to the reference sequence of SEQ ID NO: 2 or may include up 
to a certain integer number of amino acid alterations as compared to the reference 
sequence, wherein said alterations are selected from the group consisting of at least one 
amino acid deletion, substitution, including conservative and non-conservative 
substitution, or insertion, and wherein said alterations may occur at the amino- or 
carboxy-terminal positions of the reference polypeptide sequence or anywhere between 
those terminal positions, interspersed either individually among the amino acids in the 
reference sequence or in one or more contiguous groups within the reference sequence, 
and wherein said number of amino acid alterations is determined by multiplying the total 
number of amino acids in SEQ ID NO:2 by the integer defining the percent identity 
divided by 100 and then subtracting that product from said total number of amino acids 
in SEQ ID NO:2, or: 
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n a x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino 
5 acids in SEQ ID NO:2, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 
0.85 for 85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the 
symbol for the multiplication operator, and wherein any non-integer product of x a and y 
is rounded down to the nearest integer prior to subtracting it from x a . 

By way of example, a polypeptide sequence of the present invention may be 

10 identical to the reference sequence of SEQ ID NO:2, that is it may be 100% identical, or 
it may include up to a certain integer number of amino acid alterations as compared to 
the reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 

15 wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in 
one or more contiguous groups within the reference sequence. The number of amino acid 
alterations for a given % identity is determined by multiplying the total number of amino 

20 acids in SEQ ID NO: 2 by the integer defining the percent identity divided by 100 and 

then subtracting that product from said total number of amino acids in SEQ ID NO: 2, or: 

"a * *a - (*a # V)' 

25 wherein n a is the number of amino acid alterations, x a is the total number of amino 
acids in SEQ ID NO: 2, y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., 
and • is the symbol for the multiplication operator, and wherein any non-integer product 
of x a and y is rounded down to the nearest integer prior to subtracting it from x a . 

30 "Fusion protein" refers to a protein encoded by two, often unrelated, fused 

genes or fragments thereof. In one example, EP-A-0 464 discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together 
with another human protein or part thereof. In many cases, employing an 
immunoglobulin Fc region as a part of a fusion protein is advantageous for use in 

35 therapy and diagnosis resulting in, for example, improved pharmacokinetic properties 
[see, e.g., EP-A 0232 262]. On the other hand, for some uses it would be desirable to 
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be able to delete the Fc part after the fusion protein has been expressed, detected and 
purified. 

All publications, including but not limited to patents and patent applications, 
cited in this specification are herein incorporated by reference as if each individual 
5 publication were specifically and individually indicated to be incorporated by reference 
herein as though fully set forth. 
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ABSTRACT 

Human Mindin polypeptides and polynucleotides and methods for 
producing such polypeptides by recombinant techniques are disclosed. Also disclosed 
are methods for utilizing Human Mindin polypeptides and polynucleotides in therapy, 
5 and diagnostic assays for such. 
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SEQUENCE INFORMATION 
SEQ ID NO: 

1 ATCCAAAACCCCACCCCCCCCCCCCCCCTGCCCAACCCCCTCTCCCCTCTCCTCCTCCCCACTCTCCCCCCCGC 

2 CCCCCACCCTCTTCCCCCAGACTCCATCTGTTCCCCCACACCCCTCCCCAAATACAGCATCACCTTCACCCCCA 

3 ATTGGACCCAGACCGCCTTCCCCAAGCAGTACCCCCTGTTCCGCCCCCCTGCGCAGTGCTCTTCGCTGCTGGGG 

4 CCCCCCCATACCTCCCACTACACCATGTCGACCAACAACCACTACCTCACTAACCCCCTCCCCCACTTTCCGCA 

5 ACCCGGCGAGGCCTGCGCCCTGATTAACGAGATCGAGGCGGCGGGGGAGGCGCTGCAGAGCGTGCACCCGCTGT 

6 TTTCCG C G CCCGCCCTCCCCAGCC G CA CCCGG CAGA C G T C GGCGCAG CTGGAGCTCCAGCGCACGCACTCGCTG 

7 GTCTCGTTTGTGGTGCCCATCGTGCCCACCCCCCACTGGTTCGTGGGCGTCGACAGCCTGGACCTGTGCGACGG 

8 CCACCGTTCCCCCCAACACCCCCCCCTCCACCTGTACCCCTACGACCCCCCGACCCACACCCCCTTCACCTTCT 

9 CCTCCCCCAACTTCCCCACCATCCCGCAGGACACGGTCACCGAGATAACGTCCTCCTCTCCCAGCCACCCGGCC 
10 AACTCCTTCTACTACCCGCGCCTGAAGGCCCTCCCTCCCATCGCCAGGGTGACACTGGTGCGGCTGCGACAGAG 

12 CACAAACCCCCCTCCACTCCCACCTCTCCCTGTCCTCCTCCTCCCCACTCTCCCCACCCCACTCTCCCACCCTC 

13 GCGACCAAGACCAGGACTCGCTACGTCCCGGTCCAGCCCGCCAACAACCGGAGCCCCTGCCCCGAGCTCGAAGA 

14 AGAGGCTGAGTGCGTCCCTCATAACTGCGTCTAA 

15 

SEQ ID NO:2 

MBNP S PAAALCKALCALLLATLCAACQPLCCE S IC S ARALAKYSITFTGNWSQTAFPKQYPLFRrPAQWCSLLG 

Aj\H 6 SDYSMWnKNQYVSNGLnDF7VEnGEAWALIKEIEAAGEALQ6VHAVFCAPAVPSCTCQTSAELEVQRRHSL 

20 V 6 FWRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYPYDAGTD SG FTF 6S PNFATIPQPTVTEIT66 S PSHPA 

N S FYYPRLKALPPIARVTLVRLRQSPRAFIPPArVLPSRDNEIVDSASVPETPLDCEVSLWSSWGLCGGHCGRL 
CTKSRTRYVRVQPANNCGPCPELEEEAECVPDNCV* 

SEQ ID NOS:3 and 4 

CGCACgAGCGAAgACAGGAGGAACTGGAGCCTCATTGGCCGGCCCGGGGCCCCGGCCTCG 6-0- 

25 GGCTTAAATACCAgCTCCGCCCTCTGGCTGGGACCCgACCGCTGCCCCCCGCGCTCCCGC 3r30 

TCCTCCTCCCCCCTC ATC CAAAACCCCACCCCCCCCCCCCCCCTCCCCAACCCCCTCTGC 3r&0 

M E N P 6 P A — A — A h G K — A — L G 

GCTCTCCTCCTCGCCACTCTCGGCGCCGCCCGCCAGCCTCTTGGCGGAgAgTCCATCTGT 2-40 

30 A — L — L L — A T h G A — A — G Q P h G G B S 5 G 

TCCCCCAgAgCCCTCCCCAAATACAGCATCACCTTCACCCCCAATTCCACCCAgACCCCC 3-00 

£ — A — R — A — h A — K — ¥ S 1 F — ¥ — G N — W S Q T — A 

35 TTCCCCAAGCAGTACCCCCTGTTCCGCCCCCCTCCGCAgTGGTCTTCCCTGCTGCGGCCC 

F P K — Q ¥ P h F R P P A — @ W S S h L G A 

G CgCATAg €TC CgACTACACCA TOTGG A CC A A gAA CC A CT A €g TCAgTAACGGGCTCCGC 4^0 

A — H S S S ¥ S M W R K N Q ¥ V S N G L R 



gAcTTTCCCCAACCCCCCgAgCCCTCCCCGCTCATTAAgCAgATC ±&$ 

& F — A — E ft — G S A — W — A — h £ K — B £ 

5 - — SOObp gap in aoqucnco 

TAACACCAGACCCCcCCACCCCoTCCCCCCCCCCCCACCCATCGCCTCTCCCCCCCTCCT 

■* 9 Q S P — A — A P G A P ft — S H G V 6 G A P 

10 GTGCAGGcTCATCGTGCAGGCCgCCGAGCGCACAGGGGGTTTCGCGCTGoTCCTGACCGC 124 

V — @ — A — H — A — A — G G ft G H ft — G F ft — A — A P 9 ft 

GGTGACCCCGCGCCCACCATgTgTCCAATGAACCGCCCTCTCGTGCCCCCCACGGGCATT 3r&S 

G B — A — A P T I S A — M K — G P h — V — A— G T — G £■ 

15 

C CCAAACA C CCTG €g€€¥gTC€ CAACCTTG €^ 2-4-0 

G K — Q P P P F P T L h h ft G P ft V P S A 

gTCAGCCTCCTCCTCCTGCACGATAAACTCATCCCCAAGGCTCCAGCTACTgTAAATTAT 3^0 

20 L S h h L h Q B K — V £ P K — A P A — 3? L N ¥ 

GTCTCCTTATAAGTTATTCCTGCTCCACGAGATTGTCCTTCATcGTCCAGCGGCCTGGCT 5^0 

V — S h * V £ A — A P G B G P 6 S S ft — G h — A 

25 CCCACGTGGTTCCAGATACCTCAGACCTGGTGCTGTAGGCTGTGGTGAGCCCACTCTCCC 

P £ W 1 Q 1 P Q 53 W — G S ft L G * A H S P 

GACGGCGCATCCAAGCGGCCCCCACTTCAGAAGTGAATAAATGGGGCGGTTTCGGAACCC 4-&0 

B G A S K — ft G P h B K * 1 X G A — V S B A 

30 

TCAGTCTTTCCATGTTATGGATCTCTCTCCGTTTG7UVTAAAGACTATCTCTGTTGCTCAA $4rQ- 

S V F P G ¥ G S L G V * J K ? i S V A Q 



AAAAAAAAAAAAAAA 
35 K K K K K 



10 



20 



35 



SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: SMITHKLINE BEECHAM CORPORATION 



(ii) TITLE OF THE INVENTION: INTEGRIN LICAND, HUMAN MINDIN 



(iii) NUMBER OF SEQUENCES : 4 



(iv) CORRE S PONDENCE ADDRE SS : 

(A) ADDRESSEE: RATNER & PRE S TIA 

15 (B) STREET: P.O. BOX 980 

( C) CITY : VALLEY F ORGE 

(D) STATE : PA 



(E) COUNTRY : U S A 



(F) ZIP : 19482 



(-V-) — C OMPUTE R R EA D ABLE FORM: 

(A) MEDIUM TYPE : Diokctto 

(B) COMPUTER : IBM Compatible 

(C) OPERATINC SYSTEM: DOS 

25 (D) S OFTWARE : FaotSEQ for Windowa Vcroion 2.0 



(vi) CURRENT APPLICATION DATA : 



(A) APPLICATION NUMBER : TO BE AS S IGNED 
(D) FILING DATE : 07 MAY 199 8 



30 (C) CLASSIFICATION: UNKNOWN 



(vii) PRIOR APPLICATION DATA : 

(A) APPLICATION NUMBER: 60/04 6 ,106 

(B) FILING DATE : 09 MAY 1997 



(viii) ATTORNEY /AGENT INF O RMA TIO N : 
(A) NAME: PRE C TIA, PAUL F 



40 (B) REGI S TRATION NUMBER : 23,031 

(C) REFERENCE /DOCKET NUMBER : Gil 70012 

(ix) TELECOMMUNICATION INFORMATION: 



(A) TELEPHONE! 610 107 0700 

(D) TELEFAX : 6 10 407 0701 

(C) TELEX ; 8 46169 

5 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 096 baoc pairo 

10 (B) TYPE ; nucleic acid 

(C) STRANDEDNESS : oinglc 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATCCAAAACC CCACCCCGGC CGCCGCCCTC CGCAACGCCC TCTGCGCTCT CCTCCTGGCC &0 

ACTCTCCGCG CCGCCGGCCA GCCTCTTGGG CGACAGTCCA TCTCTTCCGC CAGAGCCCTG t20 

GCCAAATACA GCATCACCTT CACCGGCAAT TGGAGCCAGA CCGCCTTCCC CAACCACTAC ^ 

20 CCCCTCTTCC GCCCCCCTGC GCAGTGGTCT TCGCTGCTGG GGGCCGCGCA TAGCTCCGAC 240 

-TAGAGGA-TG-T— GGAGGAAGAA— GGAGT-A C C T C A GT AA CGG G C TG C CCGAG- T T TGCGGAAC GC 

GCCCAGGCCT GGGCGCTGAT TAAGGAGATC GAGGCGGCGG GCGAGGCGCT GCAGACCGTG 

CACCCGCTGT TTTCGGCGCC CGCCGTCCCC AGCGGCACCG GGCAGACGTC GGCCGAGCTG 420 

GAGGTGCAGC GCACGCACTC GCTGGTCTCG TTTCTCGTGC GCATCCTGCC CAGCCCCCAC £-80 

25 TGCTTCCTGG GCGTCGACAG CCTGGACCTG TGCGACGGGG ACCGTTGGCC GGAACACCCG £40 

C C CCTCCAGG--^G-T-ACCCCT- A C G A CGCC GGG ACG GAGAG CG GCTT -GAGCTT CT-GCTCCCCC 6-0-0 

AACTTCCCCA CCATCCCGCA GCACACGGTC ACCGAGATAA CGTCCTCCTC TCCCAGCCAC 6£0 

CCGGCCAACT CCTTCTACTA CCC GCGCCTG AAGGCCCTGC CTCCCATCGC CACGGTGACA 7-20 

CTGGTGCCGC TGCGACACAG CCCCAGGGCC TTCATCCCTC CCGCCCCAGT CCTCCCCACC 7-80 

30 AGGGACAATG ACATTGTAGA CAGCGCCTCA CTTCCACAAA CCCCCCTGGA CTCCGAGGTC &40 

TCCCTGTGGT CGTCCTCGCG ACTGTGCGGA GGCCACTGTG GGAGG C TCGG GACCAAGAGC £00 

AGGACTCGCT ACGTCCGGCT CCAGCCCGCC AACAACCGCA CC C CCTGCCC CGAGCTCGAA 0^0 

GAACACCCTG AGTCCGTCCC TCATAACTGC GTCTAA $94- 

35 (2) INFORMATION FOR SEQ ID NO : 2 ; 

(i) SEQUENCE CHARACTERISTICS : 

£A) — LENGTH : — 331 amino aeido 

(D) TYPE : amino acid 

40 (C) STRANDEDNES S : oinglc 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 



20 









SEQUENCE DEi 




PTIQ1 


W: SEQ ID NO 


















Pro 


Scr 


Pro 


Ala 


Ala 


-Aia- 


Lcu 












-Alar 






ratj t f 




— 5— 










i r» 


-Giy- 


— ±jryo- 


-A-i a 


Leu 


-Gye- 

1 C 




Leu 


Lqu 


T r*i i 


Ala 


Thr 


Leu 


Gly Ala Ala 










— i-=r 






LI 










Cly Gin 


Pro 


-J-rCna- 


uiy 

*> r\ 


Gly Glu 


Sea?- 






Scr 


Ala 




Ala 


Leu 


Ala 






Scr 




jU 










v_ y d" 

-> c 






Arg 




-40— 




Lyo 


Tyr 




■1 1 c 

/I tr 


rnr- 


Phc 




Gly Aan 




Scr 


Cln 


Thr 


Ala 


Phc 


Pro 




Gin 








Phc 














-55— 






Lyo 




Tyr 


— inro- 


-Lcu- 




Arg 




Pro 


Ala 


Gin 




Scr 


Scr 


Leu 


Leu 







(~^> ^ 






65— 


















Gly Ala Ala 

n c 


■ H 1 9 


cJcr 


Scr 


Aop 




6or 


Met 








Aon 


Cln 




Val 


Scr 


















Trp 


Arg 


Lyo 






Tyr 


-9-0— 




Aon 


Uly 


Lou 




Aop 


f%e- 


Ala 


Glu 


Arg Gly Glu Ala 




Ala 


Leu 


-tte- 




r^ T 1 1 




-9-5 
Glu 












Trp 


-1-0-5- 






Lyo 


UlU 


lie 




-Aia 


Ala- 


Gly Glu Ala 


Leu 




Scr 


-Va±- 


Hio 


Ala 


Val 


Phc 


Scr 


-440 

Ala 




Ala 












120 














Pro 




Va±- 


Pro 


Scr Gly 


Thr Gly Cln 


Thr 


Scr 


Ala 


-Gl*- 


Leu 


-43-5 
-G±«- 


Val 








130 




1 -> IT 










-±44 






Gin 


Arg 


Aj~g- 


His 


-Se-r- 


-Leu- 


Val 


— S c r— 


-Phe- 


Val 


Val 




He- 


-Va-1- 








±45- 










150 








-Arg^ 


1 c c 




Pro 




■Pro — A&p 




Phc 


Val 


-€iy- 


Val 




Scr 


Leu 




Leu 




-LOU 












-Aep- 






-Aep- 


170 


Cyo Aop Gly Aop Arg 


Trp 


Arg Glu Cin Ala Ala 


Lcu 




Leu 




Pro 


X 7"3 


i or* 




Aop 




Tyr 

-£■8-5- 




Tyr Aop Ala Cly Thr Aop 


Scr Gly 




Thr 


Phc 


Scr- 


Scr 


Pro 


Aon 


Phc 


-A±a- 


Thr 


-4±e- 










-±£5- 










200 












Pro 


-G±«- 


Aop 


Thr 


Val 


Thr 






Thr 


Scr 


Scr 


Scr 


Pro 


Scr 


-Hie- 


-245- 


Ala 








210 










215 










*■» ~> r» 


Pro 




Aon 




P£e- 


Tyr 


Tyr 


Pro 




Leu 




Ala 


Leu 


Pro 


Pro 






2^5- 










230 


Lyo 










lie Ala Arg Val 




Lcu 


-Vai- 


Arg 


Leu 




Gin 


Scr 




Phc 




Pro 




Ala 


-^40 










Arg 
-24-5- 






Pro Arg Ala 

-J C A 








Pro 




Pro 


Va4- 


LCU 


Pro 


Scr 






Aon 




-Sie- 


Val 




Scr 


-Aia- 


Sor 


-2-5-5 
-Va4~ 










260 


Arg 


Aap 






265 




Aop 






n n r\ 




Pro 




Thr 


Pro ■ 


Leu 








Val 


Scr 


Leu 




Scr 




-err w 








-> TIT 




Aap 


Cyo 




nor* 






Trp 




Scr 


Trp Gly 


Leu 


Cyo 


Gly Gly Hio 






4-L \J \J 




Scr 


-2*5- 


Thr 








-> ort 


-Cyo 


Gly Arg 

one 


Leu Gly Thr 


Lyo 




Arg 




Arg 


Tyr 


¥a±- 


Arg Val 


-Gifl- 


Pro 


Ala 


Aon 


Aon 




Sor 


Pro 


-3-00 




-Giu- 






^0-5- 








310 










315 


Cyo 


Pro 




Leu 


-G4* 


G±«- 


-G±a- 


■Ala 


-G±u- 


Cyo 


Val 


Pro Aop Aon 


Cyo 


-Vai 










-3-24 



30 



40 



3-2-5 3^9- 

(2) INFORMATION FOR SEQ ID NO : 3 ; 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1020 baoo pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : □ ingle 

(D) TOPOLOGY : linear 

10 (ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 ; 

GCCACGAGCG AAGACAGGAG GAACTCGAGC CTCATTGGCC GGCCCCCGGC GCCGGCCTCG 6-& 

15 CGCTTAAATA CCACCTCCGG GCTCTGCCTG CGACCCCACC GCTGCCGCCC GCGCTCCCGC 3r24 

GCTCTCCTCC TCGCCACTCT CGGCGCCCCC GCCCAGCCTC TTGGGCGAGA GTCCATCTGT 2^0- 

TCCCCCAGAG CCCTCCCCAA AT AC AG CATC ACCTTCACGC CCAATTCCAC CCAGACGCCC ^ 

TTCCCCAAGC AGTACCCCCT GTTCCGCCCC CCTGCGCACT GGTCTTCGCT CCTGGGGGCC 3-6-Q 

20 GCGCATAGCT CCGACTACAG CATGTGGAGG AAGAACCAGT ACCTCACTAA CCCCCTGCGC 

G A CTTTG€CC AA CGCGGCCA-GG€€-T- CGGCC CT GATTAAGG-AGATG^AAG A CC A G A G CCCC W 

GCAGCCCCTG GGGCCCCCCG GAGCCATGGG GTGTCGGGGG CTCCTGTGCA GGCTCATGCT 54-0 

GCAGGCCGCC CAGCGCACAG GCGCTTTCGC CCTGCTCCTG ACCCCCCTGA GGCCCCGCCG 600 

ACCATCTCTG CAATGAAGGG CCCTCTGGTG GCCGGCACGG CCATTCCCAA ACAGCCTCCT &&& 

25 CCTTTCCCAA CCTTGCTTCT TACCCCCCCC CGTCTCCCGT CTGCTCTCAC CCTCCTCCTC 

€TG€AGGATA— AAG T C A T CCC GAA€G€T€CA GCTA €T€-T-AA- ATTATGTCTC CT-^ATAAGTT 7-8G 

ATTGCTGCTC CAGGAGATTG TCCTTCATCG TCCAGCGCCC TGGCTCCCAC GTGGTTGCAG 840 

ATACCTCAGA CCTCCTGCTC TAGCCTCTGC TCAGCCCACT CTCCCGAGGG CCCATCCAAC 

CGCGGGCCAC TTGAGAAGTC AATAAATGGG GCGGTTTCGG AACCGTCACT GTTTCCATGT 960 

30 TATGCATCTC TCTGCGTTTG AATAAAGACT ATCTCTGTTG CTCAAAAAAA AAAAAAAAAA 1020 

(2) INFORMATION FOR SEQ ID NO : 4 ; 



(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 200 amino acido 

(B) TYPE : amino acid 



(C) STRANDEDNES S : ainglc 

(D) TOPOLOGY : linear 



40 



(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DE S CRIPTION : SEQ ID NO : 1 



Met Clu Aon Pro Scr Pro Ala Ala Ala Leu Ciy Lyo Ala Leu Cyo Ala 



15 



25 



35 



_i 5 3r0 


Pro 


±5 

Leu Gly Gly Glu 


Lou Lou Lou Ala Thr Lou Gly Ala Ala Cly Gin 








-±±e- 




Scr 


-A±a- 




Ala 


Leu 






Scr 


-±±e- 


Thr 


Phc 




Scr 




Cyo 

1 c 






Arg 




-4-0— 


Ala Lyo 


Tyr 




^5 










Thr 


-A±e- 






-G±fi- 


Tyr 


Pro 


Leu 


Phe 


-Aaeef 


Gly Aon Trp Scr Gin 








Pro Lyo 


















Ala 


-G±«- 




Scr 




Hio 


Scr 




Pro 


Pro 






Trp 




Scr Lou Leu Gly Ala Ala 

TIT 






Scr Aop 














Aon 


-G±a- 




Scr 


Aon 




Tyr 


Scr 


Met 


Trp 


Arg 
-&5— 


Lyo 






Tyr Val 






Gly Leu Arg Aop 


Phc 


Ala 


-G±«- 






-G±tt- 


Ala 




-I±e- 


Lyo 


Glu 


-±±e- 




-Gift 








Arg 
100 


-G±y- 






Trp Ala Leu 








~4-3rO 


Aop 




Scr 


Pro 


Ala 


Ala 




Pro 






Gly Val 


Scr 


Gly Ala 






115 




Pro Gly Ala 




Arg Scr 












Val 


-Gi-n- 


Ala 




Ala 


-A--l--a~ 




-Giy- 


Hio 


J. iU w» 


-Phe- 




P-r-o- 


130 












Gly Cly Arg- 




-4r4-0 


Arg Gly 




-Arg 


Ala 


Ala 


Pro 








-G3h±- 


Ala 


Ala Pro 


Thr 


He 


Scr 


Ala 


Mct 




±45- 






Aop 


Arg 


Gly 

-±5£- 


















Lyo 

-±60 




Pro 


Lou 


Val 


Ala 




Thr 






-Giflr 


Pro 


Pro 


Pro 


-E%e 


G±y- 










-G±y- 




Gly 


He Gly Lyo 








-±■3-5 




Pro 


Thr 


Leu 


Leu 






Pro 




Pro 


Scr 


Ala 


Leu 


Scr 


Leu 








-±SG- 


Leu Arg 


Gly 




Arg Val 
-±«5 














Leu 


Leu 


Leu 


Gln 






Val 


He 




Ala 


Pro 


Ala 


Thr 


Leu 


Aon 






195 




Aop 


Lyo 




-200- 


Pro Lyo 






-£0-5- 










Val 


Scr 


-Leu- 




-i±e- 


-A3ra- 


-Al-a^ 






-P^e- 


-S-e*?- 


Scr 


-Ge-r- 














215 




Pro Gly Aop 


-Gys~ 
-2^0 














Pro 


Thr 




Leu 


Gin He 


Pro 


-G±ft- 


Thr 




Cyo 


Scr 




Gly Leu Ala 




-23-0- 


Trp 






-235- 






Trp 








Leu 




Ala 


Hio 


Scr 


Pro 


Glu 




Ser 






Gly 


Pro 


Leu 


A3?g- 




Cyo 




"> /I c 








Cly Ala 
25-0- 




■Lyo 


Arg 




-255 




G4r«- 




lie 


Aon 




Val 


Scr 


Glu Ala 


Scr 


-¥a±- 


Phc 


Pro 








Lyo 






Gly Ala 






-2^5 










Cyo 


Tyr 




Scr 


Leu 




Val 


lie 




Thr 


He Scr 


Val 


Ala 


Gin 








G4ry- 




-2^5- 


Cyo 






Lyo 


-2£0- 








-2£5 


Lyo 


Lyo 





Lyo Lyo 
^ 



